Learning to recognize actionable static code warnings (is intrinsically easy)

نویسندگان

چکیده

Static code warning tools often generate warnings that programmers ignore. Such can be made more useful via data mining algorithms select the “actionable” warnings; i.e. are usually not ignored. In this paper, we look for actionable within a sample of 5,675 seen in 31,058 static from FindBugs. We find with remarkable ease. Specifically, range methods (deep learners, random forests, decision tree and support vector machines) all achieved very good results (recalls AUC(TRN, TPR) measures over 95% false alarms under 5%). Given these learners succeeded so easily, it is appropriate to ask if there something about task inherently easy. report while our sets have up 58 raw features, those features approximated by less than two underlying dimensions. For such intrinsically simple data, many different kinds models similar performance. Based on above, conclude learning recognize easy, using wide algorithms, since simple. If had pick one particular learner task, would suggest linear SVMs (since, at least sample, ran relatively quickly best median performance) recommend deep (since simple).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Is code equivalence easy to decide?

We study the computational difficulty of deciding whether two matrices generate equivalent linear codes, i.e., codes that consist of the same codewords up to a fixed permutation on the codeword coordinates. We call this problem Code Equivalence. Using techniques from the area of interactive proofs, we show on the one hand, that under the assumption that the polynomial-time hierarchy does not co...

متن کامل

Is Code Equivalence Easy to Decide?

متن کامل

BYTEWEIGHT: Learning to Recognize Functions in Binary Code

Function identification is a fundamental challenge in reverse engineering and binary program analysis. For instance, binary rewriting and control flow integrity rely on accurate function detection and identification in binaries. Although many binary program analyses assume functions can be identified a priori, identifying functions in stripped binaries remains a challenge. In this paper, we pro...

متن کامل

Learning to recognize plankton

We present a system to recognize underwater plankton images from the Shadow Image Particle Profiling Evaluation Recorder. As some images do not have clear contours, we develop several features that do not heavily depend on the contour information. A soft margin support vector machine (SVM) was used as the classifier. We developed a new way to assign probability after multi-class SVM classificat...

متن کامل

Learning to recognize objects.

Evidence from neurophysiological and psychological studies is coming together to shed light on how we represent and recognize objects. This review describes evidence supporting two major hypotheses: the first is that objects are represented in a mosaic-like form in which objects are encoded by combinations of complex, reusable features, rather than two-dimensional templates, or three-dimensiona...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Empirical Software Engineering

سال: 2021

ISSN: ['1382-3256', '1573-7616']

DOI: https://doi.org/10.1007/s10664-021-09948-6